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0.   INTRODUCTION 

The  errors  in  variables  model  played  a  large  role  in  the  early 
development  of  econometrics.   The  instrumental  variable  method  of  estimation 
was  developed  to  deal  explicitly  with  the  problem  of  a  "right  hand  side" 
variable  in  a  regression  model  being  non-orthogonal  to  the  "error"  in  the 
regression  specification.   However,  the  outstanding  problem  with  instrumental 
variables  estimators  has  always  been  to  specify  where  the  candidates  to 
form  instruments  come  from.   In  the  usual  simultaneous  equation  specifica- 
tions this  problem  does  not  exist;  the  included  endogenous  variables  in  an 
equation  are  non-orthogonal  to  the  structural  disturbance^ but  the  excluded 
predetermined  variables  always  yield  enough  additional  candidates  to  form 
instruments  to  permit  estimation.   This  result  follows  from  identification 
of  the  structural  model  under  the  usual  assumptions.   The  single  equation 
errors  in  variables  specification  does  not  have  these  convenient  instruments 
to  use.   Recently  Zellner  [9]  and  Goldberger  [4]  have  attempted  to  surmount 
this  problem  by  making  the  unobserved  exogenous  variable  a  (stochastic) 
linear  function  of  observable  variables.   This  procedure  provides  a  partial 
solution  to  the  problem  although  in  many  cases  the  specification  of  the  linear 
relationship  may  be  very  difficult. 

In  this  paper  I  investigate  the  simultaneous  equation  estimation 
problem  when  errors  in  variables  exist  in  the  exogenous  variables.   That  is, 
the  true  structural  specification  contains  an  unobservable  variable.   A 
"proxy  variable"  for  this  variable  is  observed.   The  estimation  problem  is 
to  incorporate  additional  information  to  permit  estimation.   If  the  proxy 
variable  is  used  in  place  of  the  true,  unobservable  variable.  Inconsistent 
estimates  result.   Instrumental  variables  may  be  applied  to  this  situation 
with  resulting  consistent  estimates.   Instead,  I  make  a  distributional 
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assumption  on  the  exogenous  variables  and  convert  the  system  back  into 
structural  form.   Efficient  estimators  are  then  proposed  for  the  system. 
Depending  on  the  stochastic  assumptions  made  about  the  original  structural 
disturbances  and  the  error  of  observation,  different  estimators  result. 
While  a  full- information  maximum  likelihood  estimator  (FIML)  may  be  used 
in  all  cases,  under  one  set  of  stochastic  assumptions  three  stage  least 
squares  (3SLS)  is  seen  to  be  asymptotically  efficient.   In  the  two  other 
cases  3SLS  is  only  consistent  so  FIML  or  a  closely  related  estimator  must 
be  used  to  achieve  efficiency.   It  turns  out  that  the  inclusion  of  errors 
in  variables  leads  to  zero  restrictions  on  the  covariance  matrix  of  the  errors 
in  these  situations;  but,  otherwise,  the  standard  set-up  remains.   To  date 
little  work  has  been  done  on  the  standard  simultaneous  equation  model  with 
zero  restrictions  on  the  covariance  matrix  -  only  the  case  of  a  diagonal 
covariance  matrix  has  been  studied.   While  FIML  provides  an  asymptotically 
efficient  estimation  scheme,  alternative  efficient  estimators  would  be  de- 
sirable to  decrease  the  computational  requirements. 

In  Section  1  the  standard  simultaneous  equation  model  is  specified. 
Estimation  by  instrumental  variables  when  there  exist  errors  in  the 
exogenous  variables  is  discussed  in  Section  2.   Next  a  full- information 
approach  to  the  problem  is  studied  when  the  exogenous  variable  with  error 
appears  in  only  one  equation.   FIML  and  3SLS  are  both  efficient  in  the 
case  of  an  unrestricted  covariance  matrix,  but  only  FIML  is  efficient  in 
the  other  case  studied.   Lastly,  the  situation  is  generalized  to  the  case 
when  the  unobserved  exogenous  variable  enters  more  than  one  equation. 
This  situation  may  lead  to  the  problem  of  nonlinear  restrictions  on  the 
covariance  matrix  containing  the  structural  parameters.   More  work  on 
estimators  needs  to  be  done  in  this  last  area. 
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1.   SPECIFICATION  OF  THE  MODEL 

Consider  the  standard  linear  simultaneous  equation  model  containing 
no  lagged  endogenous  variables  where  all  identities  (assumed  not  to  contain 
unobserved  variables)  have  been  substituted  out  of  the  system  of  equations: 

(1.1)        YB  +  zr  =  U 

where  Y  is  the  T  x  M  matrix  of  jointly  dependent  variables,  Z  is  the 
T  X  M  matrix  of  exogenous  variables,  and  U  is  a  T  x  M  matrix  of  the 
structural  disturbances  of  the  system.   The  model  thus  has  M  equations  and 
T  observations.   The  usual  assumptions  are: 

A.l  All  equations  satisfy  the  rank  condition  for  identification. 

A.  2   B  is  nonsingular. 

A. 3  The  predetermined  variables  are  assumed  orthogonal  with 

t 
respect  to  the  disturbances,  E(Z  U)  =  0,  or  asymptotically 

-1  ' 

plim  T  Z  U  =  0. 

A. A  The  second  order  moment  matrices  of  the  current  predetermined 
and  the  endogenous  variables  are  assumed  to  have  nonsingular 
probability  limits. 

A. 5  The  structural  disturbances  are  distributed  as  a  nonsingular 
normal  distribution,  U~N(0,Z  (x) I  )  where  E  is  a  positive 
definite  matrix  or  rank  M  and  no  restrictions  are  placed  on  E. 

The  identifying  assumptions  will  take  the  form  of  exclusion  restrictions 

so  after  choice  of  a  normalization  let  r.  and  s.  be  the  number  of  included 

1      1 

jointly  dependent  and  predetermined  right  hand  side  variables,  respectively, 
in  the  i   equation.   Rewriting  the  system  of  equations  in  (1.1): 
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(1.2) 


^i  =  ^i*i  ■"   "i 


i  =  1,  2,  .  .  ,  M 


where  X.  =  [Y.Z.] 
1     i  1 


^1  = 


where  the  X  contains  the  r,  +  e.   variables  whose  coefficients  are  not 

known  a  priori  to  be  zero.   Therefore,  Y.  and  u,  are  T  vectors,  X.  is  a 

T  X  (r.  +  s.)  matrix,  and  6.  is  a  (r,  +  s.)  dimension  vector.   Then 
1    i  i        i    1 

stacking  the  M  equations  into  a  system: 


(1.3) 


y  =  X6  +  u 


where  y  = 


'M 


X  = 


X^       0 


^ 


M 


u  = 


u. 


"m 
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2.   INSTRUMENTAL  VARIABLE  APPROACH  TO  ERRORS  IN  VARIABLES 

The  tr.nditional  errors  in  variables  specification  (c.f.  Kendall  and 
Stuart  [6],  p.  378  or  Goldberger  [4],  p.  1)  is  to  consider  the  observation 
of  an  exogenous  variable  subject  to  error.   Let  us  consider  the  first 
column  of  the  Z  matrix  and  denote  the  unobservable,  "true"  column  z  as 
z   .   Instead  of  z   ,  we  observe  z^ ,  which  is  determined  by  the  true 
exogenous  variable  z   ,  according  to  the  relationship 


(2.1)  z^  =  z/  +  e^ 


whe 


re  the  components  of  e^  are  i.i.d.  with  zero  mean  and  constant  variance 


* 

o   .   Also  F   is  assumed  independent  of  Z  and  in  particular  z^  .   Consideration 

^1 
of  errors  of  this  type  in  the  endogenous  variables  is  not  interesting  since 

they  are  observationally  equivalent  to  the  errors  in  the  equations.   In 

fact,  the  traditional  approach  is  to  assume  an  exact  linear  relationship 

without  a  structural  disturbance,  and  the  error  in  observing  the  endogenous 

variable  provides  the  error  term  in  the  usual  regression  format  (see  Kendall 

and  Stuart  [6]). 

The  instrumental  variable  procedure  is  usually  a  single  equation 

approach;  consider  the  i   equation  assumed  to  contain  z^ . 


(2.2)  "^i  "  ^i  *  "^  "i 


where  X.   =  [Y.  z  Z.]  where  Z.  is  the  matrix  Z.  with  the  first  column 


-5- 


deleted.   Now  if  one  replaces  the  unobservable  z^  ,  by  the  observable 
z   then  equation  (2.2)  becomes 


(2.3)  y,  =  Y,6,  +  z,Y,i  +  Z,Y,  +  u,  -  E.y, 


where  Z,  and  Y.  are  the  matrix  Z.  and  vector  y,  with  the  first  column  and 
1      1  1  i 

element  removed,  respectively.   If  z   is  treated  as  an  exogenous  variable, 
inconsistent  estimates  result  since  z      is  not  independent  of  e  .   However, 
a  consistent  estimation  procedure  proposed  by  Chernoff  and  Rubin  [3]  and  by 
Sargan  [8]  is  to  treat  z   an  another  endogenous  variable.   Then  an  in- 
strumental variable  estimator  uses  instruments  W.  to  estimate: 

X 


(2.4)  6^  =  (w!^X^)  ^  W^y^  . 


The  requirements  on  the  instruments  are  that  they  be  distributed  independently 

of  the  error  v.  =  u.  -  e,Y.i,  and  they  be  correlated  with  X.: 
X    X    1  il         ■'  X 


(2.5)  plim  T  "'"Wj.v^  =  0         plim  T  ^^\   =  Q^ 


where  Q   is  a  finite  matrix  with  rank  (r  +  s.)  with  probability  one.   1 

Note  that  all  predetermined  variables  except  z^  are  candidates  to  form 

instruments;  at  least  r .  +  s .  must  be  used  to  insure  satisfaction  of  the 

i    X 

rank  condition.   Chernoff  and  Rubin  propose  a  LIML  type  procedure  which  Is 
asymptotically  equivalent  to  the  procedure  of  Sargan  which  uses  all  such 
predetermined  variables  except  z   to  form  the  instruments: 
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I  -  -,  »  ^  ~  I 


(2.6)  W^  =  X^Z(Z  Z)Z 


where  Z  denotes  the  Z  matrix  with  the  first  column  removed.   By  con- 
struction and  assumption  A. 3,  the  estimates  are  consistent  with  asymptotic 
covariance  matrix: 


(2.7)  Cov(6)  =  a^^(X^Z(Z  Z)  ■*■  Z  X^)  "*■ 


In  fact,  Sargan  shows  that  this  choice  of  instruments  is  best  asymptotically 

given  the  choice  of  all  candidates  to  form  instruments  from  Z  the  matrix 

of  predetermined  variables.   That  is,  any  choice  of  a  subset  of  Z  to  form 

instruments  will  have  a  covariance  matrix  at  least  as  large. 

Note  that  the  rank  condition  on  the  instruments  places  an  additional 

restriction  on  equation  i.   Q.  must  have  rank  r  +  s  which  requires  at 

least  r ,  +  s   variables  to  form  the  instruments  W  .   By  the  order  condition 

for  identification,  K  >  r.  +  s..   But  since  z^  cannot  be  used  to  form  in- 

-   1    i  1 

struments  we  have  only  K-1  candidates.   Thus,  if  equation  i  is  just  identified, 
K  =  r,  +  s  ,  then  the  instrumental  variable  procedure  cannot  be  used  since 
Q.  will  not  have  full  rank.   I  will  refer  to  identification  when  neglecting 
the  presence  of  exogenous  variables  measured  with  error  as  conditional 
identification.   Then  to  have  consistent  estimates  by  the  instrumental 
variable  technique,  equation  i  must  be  conditionally  overidentified.   Only  ) 
then  will  there  be  sufficient  variables  to  form  the  instruments  to  insure 
the  rank  condition  in  Q  . 
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■^ •   ™iik  . '  NFORMATION  ESTIMATION  OF  ERRORS  IN  VARIABLES 

Consider  the  structure  of  the  instrumental  variable  procedure.   The 
instruments  must  be  correlated  with  X,  and  uncorrelated  with  the  errors  in 
equation  1.   The  usual  approach  of  instrumental  variables  procedures 
used  in  the  simultaneous  equation  context,  such  as  2SLS,  uses  the  pre- 
determined variables  to  form  the  instruments.   This  procedure  is  natural 
since  by  assumption  A. 2  the  reduced  form  exists: 


(3.1)       Y  =  zn  +  V 


where  II  =  -TB   and  V  =  UB   .   Thus  the  elements  of  Y  are  certainly  correlated 
with  the  elements  of  Z.   Also  the  elements  of  Z  are  correlated  with  Z 
since  Z  contains  Z  .   Lastly,  Z  and  u,  are  uncorrelated  by  assumption  A. 3. 
Now  when  z   is  measured  with  error,  the  instrumental  variable  approach 
requires  that  z^  be  correlated  with  Z.  and  that  Z  be  independent  of  v  . 
Again,  this  approach  seems  reasonable  since  Z  is  independent  of  u  by 
assumption  A. 3,  and  Z  is  independent  of  e  by  assumption.   Note  that  the 
assumption  has  been  made  that  z^   is  not  orthogonal  to  all  the  other  exogenous 
variables.   This  assumption  seems  reasonable  although  it  must  be  admitted  it 
does  not  have  the  force  of  the  assumption  that  Y  is  correlated  with  Z  which 
comes  from  the  structure  of  the  reduced  form  in  equation  (3.1). 

Given  the  assumption  of  correlation  among  the  "true"  exogenous  variables 
we  now  make  an  additional  distributional  assumption  about  the  exogenous 
variables  which  embodies  the  correlation  relationship.   A  natural  assumption 
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is  to  consider  a  row  of  the.  Z  matrix,  Z ,  ,  and  assume  the  K  dimension  vector 
to  be  distributed  as  multivariate  normal: 


(3.2)  A.6:   z^  =  (z^^, . . . ,z^^) ~N(y ,A)  , 


For  many  cases,  especially  cross-section  data,  it  is  reasonable  to  assume 
that  all  z   for  t=l,...,T  are  distributed  identically  with  the  same  \i   and  A. 
For  time  series  specifications,  an  assumption  such  as  this  for  variables  with 
a  time  trend  removed  might  be  plausible.   Then  by  a  well-known  theorem  on 
multivariate  normal  distributions  (c.f.  Anderson  [1],  p.  28),  the  regression 
of  the  column  vector  z  on  the  rest  of  the  columns  of  Z,  say  Z  ,  gives  the 
conditional  expectation  of  z  ,  given  Z^ : 


(3.3)  z^  =  a  +  Z,a.  +  e„ 

1    o    1  1    2 


where  the  usual  independence  assumptions  of  the  regression  model  are 

satisfied  and  e„~N(0,a  I).   Since  in  our  model  z   is  the  unobservable 

A  ^ 

variable  z^  ,  and  we  observe  z  by  equation  (2.1),  the  unobservable  variable 

A 

z   satisfies  the  stochastic  relationship: 


A 

(3.4)  z,   =  a  +  Z,a,  +  e, 

1     o     11    i 


Reformulating  an  unobservable  exogenous  variable  as  a  stochastic  linear 
function  of  other  exogenous  variables  underlies  the  approach  recently 
advocated  by  Goldberger  [4]  in  the  non-simultaneous  equation  framework. 
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Given  assumption  A. 6  in  equation  (3.2)  or  straightforvardly  assuming 

equation  (3.4)  as  does  Goldberger,  where  he  assumes  e  ~N(0,o   I),  permits 

1  '2 

a  full-information  approach  to  the  problem. 

Consider  the  system  of  equations  which  describe  the  structural  relation- 

* 

ships  and  the  relationship  of  the  unobservable  variable  z   and  its  proxy 

z   (for  now  z   is  assumed  to  only  occur  in  equation  1;  later,  the  system 
will  be  generalized) : 


(3.5)  y^^  =  Y^ii^  +  z^   Y;l1  "^  ^1^1  "^  ""l 

^1  "  ^i^i  "^  ^i^i  ■*"  "i  i  =  2,...,M 


* 
(3.6)  z^  =  z^   +  E^ 


* 

z-   =  a  +  Za,  +  e„ 
1     o     1    2 


where  in  equation  (3.5)  Z   denotes  Z  ,  with  the  first  column  deleted,  and 
in  equation  (3.6)  Z  stands  for  the  complete  predetermined  variable  matrix  Z 
with  the  first  column  deleted.   By  direct  substitution  equation  (3.5)  and 
equation  (3.6)  are  combined  to  form  the  system  of  equations: 


(3. 7. a)        y^  =  Y^B^  +  Zay^^^  +  Z^y^   +  "l  "  ^11^2 


(3.7.b)        y,  =  Y.B.  +  Z.y.  +  u.  i  =  2,...,M 


Zellner  [9]  considers  the  case  where  in  equation  (3.4)  €2-0*   Thus  the 
unobservable  variable  is  known  as  an  exact  linear  relation  of  observable 
variables.   While  this  assumption  leads  to  an  interesting  statistical 
problem,  the  assumption  seems  so  strong  that  this  case  will  not  be  considered. 
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(3.7.C)        zj^  =  Za  +  e^  +  e 


2 


where  Z  is  now  the  matrix  Z  with  a  comumn  of  ones  added  to  include  the  con- 

a 
stant  if  it  is  not  already  present,  and  a  =  (    )  is  the  combined  vector 

for  the  unknown  coefficients  in  equation  (3.6).   Note  that  Z  contains 

* 

all  the  predetermined  variables  except  z^  ,  and  Z  contains  the  included 

predetermined  variables  from  equation  1  with  z^   removed.   By  assumption  A.l, 
equation  1  of  the  structural  system  is  conditionally  identified  so  Z 
contains  no  more  than  (K-r  -1)  predetermined  variables.   However,  just  as 
with  the  instrumental  variables  procedure  discussed  in  Section  2,  for  the 
full  information  method  to  be  applicable  equation  1  must  be  conditionally 
overidentif led.   Therefore  Z  must  contain  no  more  than  K-r  -2  predetermined 
variables. 

Now  consider  the  following  estimation  procedure  for  the  system  of 
equations  contained  in  equation  (3.7).   Rewriting  equation  (3.7.c)  as 


(3.8)  z^  =  Za  +  v^^^ 


where  v.,,,  =  e,  +  e„,  note  that  by  assumption  the  elements  of  v^,,_  are 
M+1    12  M+1 

distributed  independent  normally  with  zero  mean  and  constant  variance. 
Furthermore,  by  assumption  A. 3  and  equation  (2.1),  Z  and  v    are  con- 
temporaneously independent.   Therefore  a  consistent  estimator  of  a  is  the  least 
squares  regression: 


(3.9)  a  =  (Z  Z)  "^Z  z, 
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Given  a   one  can  form  a  consistent  estimate  of  z^  from  Za,  and  this 
conditional  predictor  may  be  substituted  into  equation  (3. 7. a).   Consistent 
estimation  of  equation  (3. 7. a)  for  6,  then  proceeds  in  the  usual  way  by 
instrumental  variables.   Note  that  this  procedure  is  exactly  the  instrumental 
variable  procedure  outlined  in  Section  2  since 


(3.10)         Za  =  Z(Z  Z)"-^  Z  z^ 


which  is  the  procedure  used  in  equation  (2.6)  to  form  the  instruments  W , . 
Thus  no  gain  in  efficiency  is  offered  by  the  stochastic  specification  of 
the  predetermined  variables  in  equation  (3.2)  which  lead  to  the  linear 
structural  relationship  of  equation  (3.4).   However,  some  structural  informa- 
tion has  been  neglected  so  now  consider  the  system  in  a  full  information 
context. 

The  equation  system  (3.7)  has  the  usual  format  of  a  structural 
simultaneous  equation  system.   The  errors  are  distributed  normally,  and 
denoting  the  errors  as  v.  for  i  =  1,...,M+1  equations. 


(3.11) 


To  explicitly  write  the  covariance  matrix  C  we  must  make  a  stochastic 
assumption  about  the  relationship  of  the  u.  and  e,  and  e,.   Two  assumptions, 
leading  to  two  different  estimators  are  made: 
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D.l:   C  is  assumed  unrestricted  so  that  the  u, 's  and  £  's  are  allowed 

to  be  correlated. 

D.2:   The  u.'s  and  c.'s  are  assumed  uncorrelated  so  in  the  last  row 
1        1 

and  column  of  C,  all  entries  are  zero  except  for  the  first  and  last. 
To  estimate  the  structural  system  (3.7)  under  assumption  D.l  of 
correlation  among  the  u's  and  the  e's  first  rewrite  it  in  stacked  form: 


(3.12) 


y  =  Z6  +  V 


where  now  there  are  M+1  equations  and  z^  is  treated  as  an  endogenous 

variable  and  6  contains  the  unknown  B. 's,  Yj's,  and  a.   Given  the  distribution 

of  V  in  equation  (3.11),  the  log  likelihood  function  is: 


(3.13) 


L(6,C)  =  K  -  y  log  det  (C)  +  T  log  det  \j\ 


y  [(y-Z6)'(C(^  I  )  ^(y-Z6)]. 


where  J  is  the  Jacobian  of  the  transformation  from  v  to  y.   Therefore  J 
is  now  an  M  X  1  square  matrix  which  has  the  form: 


(3.14) 


J  = 


I  y 


11 


J  0 

0---0  1 


where  B  is  the  M  square  matrix  of  the  original  endogenous  variables.   Since 
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1 
under  assumption  D.l  C  is  unrestricted,  it  has  the  form: 


(3.15)        C  - 


1 

j^lM 

^Ml-- 

1 

The  first  order  conditions  for  the  likelihood  functions  may  be  found  and 
an  algorithm  specified  (e.g.  Hausman  [5]).   However,  since  C  is  unrestricted, 
alternative  asymptotically  efficient  estimators  are  easily  specified.   For 
instance,  3SLS  applied  to  equation  (3.12)  will  produce  efficient  estimates. 
Alternatively,  a  full  information  instrumental  variable  estimator  might  be 
used.   Both  are  fully  efficient  as  proved  in  Hausman  [5].   As  I  show,  the 
latter  estimator  if  iterated  gives  the  FIML  estimates.   It  is  interesting  to 
note  that  if  3SLS  is  used,  the  last  equation  may  be  discarded  and  the 
original  system  (1.3)  used  where  z.  is  treated  as  an  endogenous  variable. 
Since  equation  (3.7.c)  is  just  identified,  Narayanan  [7]  shows  that  in 

the  sample  (and  of  course  asymptotically)  the  3SLS  estimates  remain  the 

2 
same  if  all  just-identified  equations  are  ignored.   Thus,  the  full- 
information  analogue  of  the  instrumental  variables  procedure  discussed  in 
Section  2  gives  efficient  estimates.   The  additional  structural  information 
of  equation  (3.4)  leads  to  no  gain  in  efficiency  since  a  just-identified 
equation  has  been  added  to  the  structural  system.  Only  if  some  of  the 


Note  that  the  component  elements  of  C,  e.g.  E,  a  ,   a      ,    etc.,  cannot  be 

^1   ^2 
separately  estimated  since  they  are  not  identified.   However,  the  structural 
parameters  are  identified  and  thus  consistent  estimation  is  possible. 

2 
Of  course,  more  efficient  estimates  of  a  are  obtained  by  estimating 

equation  (3.7.c)  jointly  with  the  other  equations.   Since  this  equation 

does  not  form  part  of  the  structural  system,  one  may  not  be  interested 

in  its  coefficients.   By  the  same  reasoning,  the  second  equation  in 

Goldberger  [4]  p.  7  equation  (3.8)  may  be  ignored  with  no  loss  in  efficiency. 
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exogenous  variables  are  known  a   priori  to  be  independent  of  z^  will  the 
added  distribution  assumption  of  A. 6  lead  to  more  efficient  estimates. 
For  them  some  of  the  a.  of  equation  (3.3)  are  known  a  priori  to  be  zero, 
and  equation  (3.7.c)  would  be  overidentif ied.   Then  3SLS  on  the  complete 
system  would  be  more  efficient  than  is  3SLS  when  the  M+1  st  equation  is 
neglected  or  treated  as  just-identified. 

However,  under  assumption  D.2  of  no  correlation  among  the  u.'s  and 
e.'s  the  situation  changes  markedly.   Since  by  assumption  A. 3  the  true 
exogenous  variable  Z  vectors  and  structural  error  u  vectors  are  assumed 
orthogonal,  e„  will  be  orthogonal  to  the  structural  errors  since  it  is  a 
linear  combination  of  z^   and  Z^  as  shown  in  aquation  (3.4).   Therefore  to 
determine  whether  stochastic  assumption  D.l  or  D.2  is  appropriate,  the 
relationship  of  the  error  of  observation  e^  to  the  u  's  and  to  e„  must  be 
determined.   The  assumption  that  the  error  of  observation  is  orthogonal  to 
the  u.'s  may  be  appropriate  in  many  cases  and  will  lead  to  a  restricted 
covarlance  matrix.   Of  course,  the  alternate  specifications  D.l  and  D.2 
can  be  tested  in  the  usual  manner  by  a  likelihood  ratio  test.   Under 
stochastic  assumption  D.2  the  covarlance  matrix  C  of  the  v's  now  has  zeroes 
in  the  last  row  and  column  except  in  the  first  and  last  place: 


(3.16) 


C  = 


^11  -^  ^11  ^e. 


-^ll^e. 


^11  'e, 
0 

0 


■(5T..0  I  a   +""0 

"l    ^2 


"Here  c^   and  £2  ^re  assumed  uncorrelated.   If  this  assumption  is  not  made, 

the  covarlance  matrix  C  in  equation  (3.16)  has  the  same  form  in  terms  of 

location  of  zeroes,  but  the  individual  terms  a       and  a       cannot  be  separated 

from  a  e„.  ^1      ^2 

£  2  -^       ^ 
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The  first  and  last  elements  in  the  last  row  and  column  of  C  put  no  restrictions 
on  C  beyond  the  usual  symmetry  restriction  since  they  are  "just-identified", 
but  C„.  for  i  =  2,...,M-1  are  now  zero.   The  likelihood  function  remains 
that  given  in  equation  (3.13),  but  now  the  simplification  to  3SLS  or  full- 
information  instrumental  variables  is  no  longer  possible  since  C  is  now 
restricted.   Thus  under  assumption  D.2,  3SLS  is  still  consistent  but  no 
longer  asymptotically  efficient.   Unless  the  complete  system  is  just- 
identified  it  still  is  better  asymptotically  than  the  single  equation 
instrumental  variable  estimator  discussed  in  Section  2. 

However,  a  significant  simplification  can  be  made  to  the  likelihood 
equation  (3.13).   By  taking  derivatives  with  respect  to  the  unknown  elements 
of  C,  setting  these  derivatives  equal  to  zero,  and  substituting  back  into 
(3.13)  the  "concentrated"  likelihood  function  is  obtained: 


(3.16)         L*(6)  =  K*  -  |-  log  det  (C)  +  T  log  det  |j 


To  maximize  this  function,  derivatives  are  taken  only  with  respect  to  the 
unknown  elements  in  C.   To  maximize  the  concentrated  likelihood  function, 
a  nonlinear  "hill-climbing"  algorithm  is  required.   An  algorithm  with 
guaranteed  convergence  which  has  desirable  second  order  properties  is  outlined 
in  Berndt,  Hall,  Hall,  and  Hausman  [2].   The  resulting  FIML  estimate  will 
be  asymptotically  efficient. 

An  interesting  question  arises  if  there  exists  an  "n-step"  estimator 

2 
(n  being  small)  of  the  minimum  x  type  which  is  asymptotically  efficient. 

One  such  estimator  is  given  in  Berndt,  et.al.   Starting  off  with  consistent 

estimates  by  3SLS  and  stopping  after  one  iteration  of  the  nonlinear  algorithm 

yields  efficient  estimates.   Only  evaluation  of  first  derivatives  of  equation 

(3.16)  are  required,  so  the  procedure  is  relatively  simple.   However, 
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we  might  hope  for  a  simpler  estimator  which  does  not  require  any  nonlinear 
derivatives  of  the  likelihood  function  in  equation  (3.16).   Since  the 
restrictions  on  C  given  by  equation  (3.16)  take  such  a  simple  form,  3SLS  or 
the  full  information  instrumental  variables  estimator  might  be  suitably 
altered  to  give  asymptotic  efficiency.   This  topic  requires  further 
investigation. 


Iteration  of  the  full- information  variables  estimator  seems  to  give  FIML 
where  the  procedure  outlined  in  Hausman  [5]  is  altered  to  set  the  known 
elements  of  C  equal  to  zero.   It  would  be  nice  to  further  simplify  this 
estimator  and  yet  retain  efficiency,  but  I  have  not  yet  been  able  to  do  so. 
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A.   FULL  INFORMATION  ESTIMATION  WHEN  THE  UNOBSERVED  VARIABLE  ENTERS  MORE  ' 
THAN  ONE  EQUATION 

So  far  only  the  only  situation  considered  has  been  when  the  unobservable 

* 
"true"  variable  z   enters  the  first  equation.   A  natural  generalization 

which  leads  to  an  interesting  estimation  problem  is  to  allow  the  unobservable 

variable  to  enter  additional  equations.   The  full  generality  of  the  situation 

is  conveyed  when  z   enters  the  first  two  equations  and  notational  problems 

are  minimized  so  only  that  particular  case  is  considered.   Thus  the  system 

* 
of  equations  in  (3.7)  is  altered  to  reflect  the  presence  of  z^   in  the  first 

two  equations: 


(4.1. a)        y^  =  Y^3^  +  ^ay-^-^  +   Z^y^^  "^  ^1  "  '^11^2 


(4.1.b)        72  =■  Y202  +  ^"^21  +  Z2Y2  "•"  "2  ~  ^21^2 


(4.1.C)        ^i  "  ^l^i  ■*■  ^i^i  "^  "i  i=3,...,M 


(A.l.d)        z.  =  Za  +  e  +  e„ 


where  Z  is  the  matrix  of  all  observable  exogenous  variables,  Z  and 
Z   contain  the  exogenous  variables  from  the  first  two  equations  with  z 
removed,  and  y-,  and  y„   are  the  unknown  parameters  corresponding  to  the 
exogenous  variables  in  Z  and  Z  .   The  system  is  again  stacked  in  the  form 


(4.2)         y  =  Z6  +  V 
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wlu<re  V  has   the  distribution: 


(A. 3) 


\ 


h 


-   Yj^i^^  ,..  N(0,C  X  I^) 


M+1 


"2  -  ^21^2 
?3 


This  system  of  equations  still  has  the  standard  form  of  a  simultaneous 
equation  system  with  the  likelihood  function  given  by  equation  (3.13). 
The  Jacobian  of  the  transformation  has  the  form: 


(A. 4) 


B,  Y 


'11 
21 


"(T-  -"0  J 


If  assumption  D.l  is  made  so  that  C  is  unrestricted,  the  analysis  of  Section  3 

holds  true.   So  long  as  the  second  equation  is  conditionally  overidentif ied, 

* 
treating  z.   as  endogenous  still  leaves  the  equation  identified.   FIML  can 

be  derived,  or  asymptotically  efficient  estimators  such  as  3SLS  or  a  full- 
information  instrumental  variable  estimator  offer  significant  computational 
simplification.   Again  equation  (A.l.d)  is  just-identified  so  that  it  may  be 

neglected  when  doing  3SLS  with  no  loss  of  efficiency.   Equation  (A.l.d)  need 

* 
only  be  considered  explicitly  if  it  is  known  a^  priori  that  z^   is  independent 

of  certain  of  the  exogenous  variables. 

As  before  the  situation  changes  markedly  when  distributional  assumption 
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D.2  of  no  correlation  among  the  u^'s  and  e.'s  is  made.   For  the  C  is 
restricted  and  has  the  form: 


(A. 5) 


C  = 


\ 


°12  •*■  ^11^21'^e 


I 


2j     / 


r 


I  \ii^ 


1  ^21^e 


Y-.-.0     YoiO 0~-0-a       +  a 

'11  c^        '21  e^  c^         e^ 


L 


This  covariance  matrix  has  an  even  more  complicated  structure  than  the 

covariance  matrix  of  equation  (3.16).   Not  only  are  there  zeros  present 

in  the  last  row  and  column  but  C,,,,  ,  and  C^.,  _  must  satisfy  the  nonlinear 

M+1 , 1      M+1 , 2 

restriction: 


(4.6) 


S+1.1  ^  'hi 

S+1,2   ^21 


Thus  again  3SLS  is  not  asymptotically  efficient,  but  FIML  must  be  used. 

The  nonlinear  restriction  of  equation  (4.6)  raises  an  additional  complication 

but  can  be  handled  in  a  straightforward  manner.   Again  beginning  with  a 

consistent  estimate  from  3SLS  (ignoring  the  nonlinear  and  zero  restrictions 

in  C),  one  iteration  of  the  previous  algorithm  will  give  asymptotic  efficiency. 

Thus  under  assumption  D.l  the  location  of  the  unobservable  variable  in  other 

equations  does  not  change  estimators.   However  under  assumption  D.2  the 


Note  that  if  equation  (4.1.b)  were  conditionally  just  identified,  identi- 
fication could  be  achieved  using  the  equality  of  equation  (4.6).   For  further 
details  of  identification  see  Geraci  [10]. 
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covarlance  matrix  of  the  errors  is  again  restricted  which  must  be  handled 
by  the  FIML  algorithm. 

5.   CONCLUSION 

In  this  paper  the  simultaneous  equation  model  has  been  enlarged  to 
permit  errors  in  the  exogenous  variables.   By  making  an  explicit  distri- 
butional assumption  on  the  relationship  of  the  exogenous  variables,  the 
errors  in  variable  problem  is  transformed  to  a  structural  specification. 
The  error-ridden  variable  is  then  treated  as  endogenous,  and  a  just- 
identified  structural  equation  is  added  to  the  model.   FIML,  3SLS,  and  full- 
information  instrumental  variable  procedures  are  all  efficient  estimators 
under  the  assumption  of  an  unrestricted  error  covarlance  matrix.   In  fact, 
the  additional  structural  equation  may  be  neglected  in  3SLS,  and  identical 
results  are  obtained  both  in  the  sample  and  asymptotically.   Thus  the  earlier 
instrumental  variable  approach  is  very  close  to  the  more  explicit  structural 
approach  of  this  paper.   The  structural  method  will  differ  only  when  it  is 
known  a   priori  that  the  unobservable  variable  is  distributed  independently 
of  some  of  the  other  exogenous  variables. 

The  other  case  considered  is  when  the  error  covarlance  matrix  is  not 
unrestricted.   Then  only  FIML  is  asymptotically  efficient.   3SLS  is  not 
efficient  because  it  does  not  treat  the  presence  of  zeros  in  the  covarlance 
matrix.   A  two-step  efficient  method  is  proposed,  but  further  work  might 
be  done  in  this  area  to  search  for  a  more  simple  efficient  estimator.   Lastly, 
the  case  where  the  unobservable  variable  enters  more  than  one  structural 
equation  generates  interesting  nonlinear  restrictions  on  the  covarlance  matrix. 
This  last  case  is  the  most  complicated  since  the  nonlinear  restrictions 
Involve  the  structural  parameters  (slopes)  of  the  system.   Thus  the  main 
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conclusion  of  this  paper  is  that  the  addition  of  errors  in  variables  to  the 
simultaneous  equation  model  may  lead  to  restrictions  on  the  structural 
covariance  matrix.   FIMT.  is  well  specified,  and  efficient  algorithms  exist 
to  maximize  the  likelihood  function.   I  intend  to  do  more  research  in 
finding  more  simple  computational  estimators  which,  like  3SLS  in  the 
standard  simultaneous  equation  case,  are  asymptotically  efficient  but  easier 
to  compute  than  FIML. 
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